Model Selection

Off-Policy Learning

# Off-Policy Learning

Tqc PandaPickAndPlace V1

This is a deep reinforcement learning model based on the TQC algorithm, specifically designed for the PandaPickAndPlace-v1 environment, used for robotic arm grasping and placing tasks.

Molecular Model

Sac Walker2d V3

This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.

Dqn MountainCar V0

This is a DQN agent model trained using stable-baselines3, specifically designed to solve reinforcement learning tasks in the MountainCar-v0 environment.

Molecular Model

Decision Transformer Gym Hopper Medium

This is a decision transformer model trained on medium-performance trajectories in the Gym Hopper environment, suitable for continuous control tasks.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase